# ELEC 3300 Introduction to Embedded Systems # Topic 10 Direct Memory Access (DMA) Prof. Vinod Prasad ## **Expected Outcomes** - On successful completion of this topic, you will be able to - Classify three different I/O modules - Describe the basic operations of the three different I/O modules - Understand the hardware configurations of DMA - Illustrate the operation of DMA block transfer mode # What is the function of Direct Memory Access (DMA)? minimal CPU action DMA provides high-speed data transfers between: - -Peripherals and memory - -Memory and memory Data can be quickly moved by the DMA without any CPU action. This keeps CPU resources free for other operations. ## What is a buffer? #### An area of memory that is used to store temporary data Depending on what the hw subsystem is, the rate at which a hardware event is generated can be unpredictable. For example, if the hardware subsystem is a keypad, the time interval between two keystrokes generally varies. If both the input rate and output rate of the buffer is known, we can easily calculate the size of the buffer such that no overflow or underflow can ever occur. HW typically runs much faster than software without a buffer sw module cannot keep up with defa input from hw module overthan of data #### The Buffer Caution – boundary region of data overflow starts from here. Caution – boundary region of data underflow starts from here. ## Concurrency Between I/O and Programs How to write a block of data from disk to memory? - Three different I/O modules ## I/O Module: Programmed I/O Suppose this is done one-byte at a time under the software control. Issue Read CPU | I/O Command to **Physical View** I/O device e.g. Keyboard, e.g. Cache. mouse, Printer, Read Status I/O □ CPU RAM Executing modem.... of I/O device Not Embedde Buffer Read Frror **Check Status** I/O Conditio **Program** Software Read Device Memor μC or μP Read Word I/O □ CPU from I/O device Increment of I/O and memory Write Word CPU ☐ Memory addresses Into Memory No Applications: Done? Keyboards interfacing • Occasional application to other simple character-based data transfer Yes **Next Instruction** Cons: CPU is heavily involved. ELEC 3300: Fall 2021 ## I/O Module: Interrupt I/O Vinod Prasad ELEC 3300: Fall 2021 ## Increasing Concurrency Between I/O and Programs # I/O Module: Direct Memory Access (DMA) - What if we make the I/O devices a little smarter? - Make the device capable of moving data to/from memory itself - Advantage: it would no longer need the processor to move the data - Overhead: DMA controller and Additional Control logics #### DMA – How - The device can buy memory access cycles from the bus while the processor is not reading or writing. - Generally, a block move is set up, say 512 bytes per block - The processor initiates the block data transfer by memory-mapped I/O - A whole block of data is then transferred without further processor intervention - To do this, the DMA controller takes control of the bus (instead of the processor) to sequentially copy all the bytes in the block - On completion, the DMA controller restores control of the bus to the there can be 1 hit to specify whether it is a memory or 10 derice CPU Memory-mapped I/O: Uses the same address space to address both memory and I/O devices. The memory and registers of the I/O devices are mapped to address values. So when an address is accessed by the CPU, it may refer to a portion of physical RAM, or it can instead refer to memory of the I/O device. Thus, the CPU instructions used to access the memory can also be used for accessing devices. #### **DMA Hardware** BR: the DMA controller requests use of the bus for a DMA transfer #### **DMA Controller** - When inactive, the DMA controller looks like a normal interface (i.e. the internal registers can be accessed by the processor using a normal read/write bus cycle) - But when the DMA controller is granted the bus, it generates its own read/write cycles (i.e. it issues address and control signals for the memory and device interfaces) - This allows the DMA controller to transfer a byte (or word) between memory and device every bus cycle if desired ## **DMA Operation (Block Transfer Mode)** As an example, consider reading a block from disk (i.e., I/O device): Processor tells the Device Interface the address of block on disk. Processor tells the DMA Controller the address and the size of block in memory. Device Interface reads block from disk to its internal buffer, while CPU continues normally. Once the device interface completes reading the block, it asserts REQ to DMA Controller. DMA Controller asserts BR. Processor asserts BG. ## DMA Operation (Block Transfer Mode) DMA Controller puts memory address on Address Bus, asserts control signals to handshake with memory, assert Read and GO to Device Interface Device Interface outputs byte of data from its buffer to Data Bus Memory uses the data and address on the bus to store data DMA Controller de-asserts GO, and Device Interface disconnects from Data Bus Repeat Step 7 to 10 until whole block transferred. DMA Controller de-asserts BR and the DMAC to DiskInterface, R/W processor regains control of bus DMA Controller sends an interrupt to Processor ## DMA Operation (Block Transfer Mode) #### As an example, consider reading a block from disk: processor fetches the instruction that loads the starting address of the block in the memory to the DMA controller register and, the instruction that loads the starting address on the Disk. processor fetches the instruction that loads the maximum wordcount (i.e. the number of bytes to be transferred) to the DMA controller register The DMA controller reads the specified number of bytes processor goes on and executes other tasks. processor receives the interrupt and do something. processor asserts a BG signal to the DMA controller. The DMA controller sends an interrupt signal to indicate completion. from memory and transfers it to the Disk. # DMA – Why The reason to use DMA are block ziza - Higher performance: since it requires only one interrupt per N-bytes transfer to check the device's readiness rather than one interrupt per byte (as in interrupt I/O). - Higher performance: executing N move instructions to transfer N-bytes of data is slow compared to stealing a cycle from the bus and doing the transfer - Used for slower devices that requires a large number of block transfer such as disk, tapes, etc ## DMA in STM32 Figure 49. DMA block diagram in low-, medium- high- and XL-density devices - The DMA2 controller is available only in high-density and XL-density devices. - 1. ADC3, SPI/I2S3, UART4, SDIO, TIM5, TIM6, DAC, TIM7, TIM8 DMA requests are available only in high- RCC: Power save mode: Switch off on-chip peripherals by removing access to their master clocks. ## DMA in STM32 Figure 50. DMA1 request mapping Must ENABLE the channel. Table 78. Summary of DMA1 requests for each channel | Peripherals | Channel 1 | Channel 2 | Channel 3 | Channel 4 | Channel 5 | Channel 6 | Channel 7 | |----------------------|-----------|-----------|---------------------|-----------------------------------|------------------|-----------------------|----------------------| | ADC1 | ADC1 | | | | | | | | SPI/I <sup>2</sup> S | | SPI1_RX | SPI1_TX | SPI2/I2S2_R<br>X | SPI2/I2S2_T<br>X | | | | USART | | USART3_TX | USART3_RX | USART1_TX | USART1_RX | USART2_RX | USART2_TX | | I <sup>2</sup> C | | | | I2C2_TX | I2C2_RX | I2C1_TX | I2C1_RX | | TIM1 | | TIM1_CH1 | TIM1_CH2 | TIM1_CH4<br>TIM1_TRIG<br>TIM1_COM | TIM1_UP | TIM1_CH3 | | | TIM2 | TIM2_CH3 | TIM2_UP | | | TIM2_CH1 | | TIM2_CH2<br>TIM2_CH4 | | TIM3 | | TIM3_CH3 | TIM3_CH4<br>TIM3_UP | | | TIM3_CH1<br>TIM3_TRIG | | | TIM4 | TIM4_CH1 | | | TIM4_CH2 | TIM4_CH3 | | TIM4_UP | ## In-class activities (Question 1-2) #### For Android devices, search HKUST iLearn at Play Store. #### For iOS devices, search **HKUST iLearn** at App Store. ## A project demo on DMA application https://www.youtube.com/watch?v=q50A47aAI9M This test was taking 84.326 seconds no DMA and no Optimization. It took only 33.71 seconds with DMA+optimization. ELEC 3300: Fall 2021 ## Reflection (Self-evaluation) - Do you .... - Distinguish three different I/O modules and their operations? - List some examples of applying different I/O in modern computer ? - List the hardware configurations of DMA? - Understand the operations of block transfer mode of DMA?